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AMENDMENT 



Please amend the application as follows: 
In the specification : 

Please replace the paragraph beginning at page 1, line 4 with the following rewritten 
paragraph: 

-- This application is a part-in-part of U.S. Patent Application Serial No. 09/318,528, 
filed May 25, 1999, which is a continuation-in-part of U.S. Patent Application Serial No. 
09/291,931, now U.S. Patent No. 6,190,897, filed April 13, 1999, which is a continuation of U.S. 
Patent Application Serial No. 09/259,214, now U.S. Patent No. 6,1 10,719, filed March 1, 1999, 
which is a divisional of U.S. Patent Application Serial No. 08/910,798, now U.S. Patent No. 
5,876,997, filed August 13, 1997, all of which are hereby incorporated by reference in their 
entirety. -- _ _ _ _ . __. _ . — ~ 

Please replace the paragraph beginning at page 24, line 29 with the following rewritten 
paragraph: 

- A "comparison window", as used herein, includes reference to a segment of any one of 
the number of contiguous positions selected from the group consisting of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared 
to a reference sequence of the same number of contiguous positions after the two sequences are 
optimally aligned. Methods of alignment of sequence for comparison are w ell-know n in the art. 
Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology 
algorithm of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment 
algorithm of Needleman & Wunsch, J. Mol. Biol 48:443, 1970, by th for similarity method 

of person & Lipman, Proc. Natl Acad. Sci. USA 85:2444, 1988, by co- puterized implementations 
of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
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visual inspection. Other algorithms for determining homology or identity include, for example, in 
addition to a BLAST program (Basic Local Alignment Search Tool at the National Center for 
Biological Information), ALIGN, AMAS (Analysis of Multiply Aligned Sequences), AMPS 
(Protein Multiple Sequence Alignment), ASSET (Aligned Segment Statistical Evaluation Tool), 
BANDS, BESTSCOR, BIOSCAN (Biological Sequence Comparative Analysis Node), BLIMPS 
(BLocks IMProved Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, CLUSTAL W, 
CONSENSUS, LCONSENSUS, WCONSENSUS, Smith-Waterman algorithm, DARWIN, Las 
Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), Framealign, Framesearch, 
DYNAMIC, FILTER, FSAP (Fristensky Sequence Analysis Package), GAP (Global Alignment 
Program), GENAL, GIBBS, GenQuest, ISSC (Sensitive Sequence Comparison), LALIGN 
(Local Sequence Alignment), LCP (Local Content Program), MACAW (Multiple Alignment 
Construction & Analysis Workbench), MAP (Multiple Alignment Program), MBLKP, MBLKN, 
PIMA (Pattern-Induced Multi-sequence Alignment), SAGA (Sequence Alignment by Genetic 
Algorithm) and WHAT-IF. Such alignment programs can also be used to screen genome 
databases to identify polynucleotide sequences having substantially identical sequences. A 
number of genome databases are available, for example, a substantial portion of the human 
genome is available as part of the Human Genome Sequencing Project (Gibbs, 1995). At least 
twenty-one other genomes have already been sequenced, including, for example, M. genitalium 
(Fraser et al, 1995), MJannaschii (Bult et al, 1996), H. influenzae (Fleischmann et ai, 1995), E. 
coli (Blattner et al, 1997), and yeast (S. cerevisiae) (Mewes et al, 1997), and D. melanogaster 
(Adams et al, 2000). Significant progress has also been made in sequencing the genomes of model 
organism, such as mouse, C elegans, and Arabadopsis sp. Several databases containing genomic 
information annotated with some functional information are maintained by different organization, 
and are accessible via the internet, for example, The Institute for Genomic Research website; the 
University of Wisconsin Laboratory of Genetics website; the Stanford genome website; the HIV 
database website; the National Center for Biotechnology Information website; the European 
Bioinformatics Institute; the Institut Pasteur website; and the Whitehead Institute/MIT Center for 
Genome Research website. — 
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Please replace the paragraph beginning at page 26, line 7 with the following rewritten 
paragraph: 

-- One example of a useful algorithm is BLAST and BLAST 2.0 algorithms, which are 
described in Altschul et al, Nuc. Acids Res. 25:3389-3402, 1977, and Altschul et al., J. Mol. Biol. 
2J_5:403-410, 1990, respectively. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information website. This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query 
sequence, which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are extended in both directions along 
each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always >0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the cumulative 
alignment score falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine 
the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses 
as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N— 4 and a comparison of both 
strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and 
expectations (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. 
Acad. Sci. USA 89: 1 09 1 5, 1 989) alignments (B) of 50, expectation (E) of 1 0, M=5, N= -4, and a 
comparison of both strands. - 



